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Fast  Computation  in  Adaptive  Tree  Approximation* 


Peter  Binev  and  Ronald  DeVore 


Abstract 

Adaptive  methods  of  approximation  arise  in  many  settings  including  numerical 
methods  for  PDEs  and  image  processing.  They  can  usually  be  described  by  a 
tree  which  records  the  adaptive  decisions.  This  paper  is  concerned  with  the  fast 
computation  of  near  optimal  trees  based  on  n  adaptive  decisions.  The  best  tree 
based  on  n  adaptive  decisions  could  be  found  by  examining  all  such  possibilities. 
However,  this  is  exponential  in  n  and  could  be  numerically  prohibitive.  The  main 
result  of  this  paper  is  to  show  that  it  is  possible  to  find  near  optimal  trees  using 
computations  linear  in  n. 

AMS  subject  classification:  65Y20,  65N50,  41A63,  41A15,  68W40,  68W25. 

Key  Words:  tree  approximation,  adaptive  approximation,  n-term  approximation, 
degree  of  approximation,  fast  algorithms. 


1  Introduction 

Tree  approximation  is  a  form  of  nonlinear  approximation  in  which  the  approximants  are 
required  to  have  a  certain  structure  described  by  trees.  It  was  introduced  and  studied 
in  [2]  in  the  context  of  n-term  wavelet  approximation  and  the  application  of  this  form 
of  approximation  to  image  compression.  Wavelets  are  naturally  indexed  on  the  set  D  of 
dyadic  cubes  which  has  a  tree  structure  determined  by  set  inclusion.  In  this  setting,  tree 
approximation  with  n  terms  seeks  to  approximate  a  given  target  function  /  by  a  linear 
combination  of  n  wavelets  whose  indices  from  D  are  required  to  align  themselves  on  a 
tree.  The  tree  structure  gives  an  efficient  way  to  encode  the  positions  of  the  wavelets 
appearing  in  the  sum.  This  form  of  approximation  is  more  restrictive  than  the  usual 
n-term  approximation  by  wavelets  where  the  n  wavelets  used  in  forming  the  approxima¬ 
tion  are  completely  arbitrary.  Both  of  these  forms  of  approximation  are  nonlinear  since 
the  n-terms  may  depend  on  /.  The  paper  [2]  gives  results  on  the  error  of  n-term  tree 
approximation  and  shows  how  these  can  be  used  to  build  efficient  encoders  for  image 
compression. 

Tree  approximation  also  occurs  naturally  in  the  construction  of  adaptive  methods  for 
approximation.  Given  a  domain  Q  and  a  target  function  /  defined  on  Q,  we  approximate 

*This  work  has  been  supported  in  part  by  the  Office  of  Naval  Research  Contract  N00014-91-J-1343, 
(N00014-91-J-1076),  the  Army  Research  Office  contract  DAAD  19-02-1-0028,  and  the  National  Science 
Foundation  grants  DMS  9970326,  DMS  0221642. 
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/  by  dividing  fl  into  a  partition  P  of  n  cells  and  using  piecewise  polynomials  (of  some 
fixed  degree)  subordinate  to  this  partition.  These  cells  are  obtained  by  some  adaptive 
process  which  decides  when  and  how  a  cell  should  be  divided.  Thus,  the  partition  P  can 
be  described  by  a  tree  which  indicates  the  adaptive  divisions  that  have  occurred. 

The  usual  application  of  this  second  form  of  tree  approximation  is  to  adaptive  numer¬ 
ical  methods  for  PDEs.  This  application  is  quite  different  from  image  processing.  Indeed, 
in  image  processing  based  on  wavelet  expansions,  the  target  function  and  all  of  its  wavelet 
coefficients  can  be  considered  to  be  known  and  the  only  problem  is  how  to  organize  a  good 
approximation  with  the  required  tree  structure.  In  the  PDE  setting  however,  the  target 
function  is  unknown  and  information  about  it  can  only  be  obtained  through  certain  nu¬ 
merical  queries.  This  information  is  usually  given  in  the  form  of  bounds  on  the  local 
approximation  error  through  the  residual.  In  Adaptive  Finite  Element  Methods  (AFEM) 
the  local  error  bounds  are  called  a  posteriori  error  estimators  and  are  a  central  issue  in 
building  good  adaptive  solvers. 

However,  even  in  AFEM,  the  adaptive  approximation  of  known  target  functions  arises 
and  must  be  numerically  handled  in  an  efficient  manner.  For  example,  in  the  numerical 
resolution  of  elliptic  equations,  the  right  hand  side  of  such  an  equation  is  known  but  must- 
be  efficiently  approximated  numerically.  Also  methods  using  coarsening  of  partitions  will 
encounter  a  similar  problem.  In  fact,  our  motivation  for  the  present  paper  occurred  in 
our  study  of  AFEM  and  the  results  of  this  paper  are  used  to  build  a  numerical  AFEM 
with  proven  convergence  rates  in  [1], 

We  could  find  a  best  tree  approximation  to  /  with  n  interior  nodes  by  forming  all  pos¬ 
sible  trees  with  n  interior  nodes  and  examining  the  global  error  associated  to  each  tree. 
We  would  choose  the  tree  which  gives  the  smallest  error  bound.  However,  there  are  expo¬ 
nential  in  n  such  trees  so  this  method  is  computationally  prohibitive  when  n  is  large.  The 
main  point  of  the  present  paper  is  to  show  that  under  appropriate  settings,  it  is  possible 
to  create  near  best  adaptive  n  term  approximations  using  only  O(n)  computations. 

In  §3  we  shall  give  a  precise  description  of  the  setting  for  this  paper  and  formulate  our 
main  results.  In  the  sections  that  follow,  we  shall  introduce  two  algorithms  for  attaining 
the  performance  we  want.  The  first  algorithm  works  under  more  restrictive  assumptions 
than  the  second  but  may  be  more  intuitive  in  its  structure.  In  the  last  two  sections  we 
give  examples  of  how  to  apply  these  algorithms  in  settings  such  as  those  mentioned  above 
for  AFEM. 


2  Adaptive  partitioning 

In  this  section,  we  shall  describe  the  setting  of  adaptive  refinement  that  arises  in  adap¬ 
tive  numerical  methods  for  PDEs.  We  shall  see  that  such  adaptive  partitioning  can  be 
described  by  trees. 

Let  Q  be  a  polygonal  domain  in  lRd .  By  a  partition  P  for  Q  we  shall  mean  a  collection 
of  simplices  A  such  that 

1JA  =  S1,  (2.1) 

A  eP 
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and 


(2.2) 


meas(A  fl  A')  =  0  for  any  pair  of  different  simplices  A,  A'  e  P  , 

where  meas  denotes  Lebesgue  measure  in  lRd . 

The  adaptive  algorithms  of  interest  to  us  are  based  on  subdividing  certain  of  the 
simplices  in  P.  The  choice  of  which  simplices  to  subdivide  is  adaptive  and  based  on  data 
dependent  criteria.  However  the  rule  for  subdividing  the  simplices  is  fixed  in  advance. 
Namely,  we  assume  that  given  any  simplex  A,  this  rule  gives  a  collection  Aj  C  A,  j  = 
1, . . . ,  m( A),  with  m( A)  >  2,  of  simplices  which  partition  A.  The  simplices  A1; . . . ,  Am(A) 
are  called  the  children  of  A  and  A  is  their  parent.  For  example,  in  the  case  d  =  2,  one 
such  rule  might  be  to  subdivide  a  given  triangle  into  four  triangles  by  using  the  bisectors 
to  each  side.  There  are  many  other  possibilities.  However,  we  stress  the  important  fact 
that  the  decision  on  which  cells  will  be  subdivided  is  data  dependent  but  how  they  are 
subdivided  is  not  allowed  to  be  data  dependent  in  the  setting  of  this  paper. 

Let  P  be  a  partition  of  fl.  An  elementary  refinement  P'  of  P  is  the  replacement  of 
one  simplex  A  e  P  by  the  K  new  simplices  Aj,  j  =  1, ...  K .  given  by  the  subdivision  rule. 
All  other  simplices  in  P  remain  untouched. 

An  adaptive  partitioning  consists  of  a  series  of  elementary  refinements.  Starting  with 
an  initial  partition  P0,  a  set  of  simplices  M0  are  marked  for  elementary  refinement.  The 
new  partition  P\  is  obtained  by  replacing  each  marked  cell  by  its  elementary  refinement 
but  leaving  all  other  cells  untouched.  We  continue  with  this  process  of  marking  cells 
and  refining  and  thereby  generate  the  partitions  P| , . . . ,  Prn  such  that  each  Pj  is  the 
refinement  of  Pj-i,  j  =  1, . . . ,  rn.  The  complexity  of  the  final  partition  Pm  is  determined 
by  the  number  of  marked  cells,  i.e.  the  cardinality  of  n  :=  #(U which  is  the  same 
as  the  number  of  elementary  refinements  that  have  taken  place. 

Each  adaptively  generated  partition  P  can  be  represented  by  a  tree  T  =  T(P).  The 
nodes  in  this  tree  T  are  the  simplices  of  Pj,  j  =  0, . . . ,  rn.  The  roots  of  the  tree  are  the 
simplices  A  in  P0.  The  set  C{T)  of  leaves  of  the  tree  T  are  the  simplices  in  the  final 
partition  Prn.  Recall  that  the  leaves  of  a  tree  T  are  the  final  nodes,  i.e.  the  nodes  in  T 
which  have  no  children  in  T. 

The  trees  T(P )  which  arise  in  adaptive  partitioning  are  all  finite  subtrees  of  an  infinite 
tree  P  which  we  call  the  master  tree.  The  tree  T*  has  as  its  nodes  all  of  the  possible 
simplices  A  that  can  arise  in  the  refinement  process.  This  idea  of  a  master  tree  which 
delineates  the  finite  subtrees  that  can  arise  in  an  adaptive  algorithm  occurs  in  more  general 
contexts  such  as  trees  that  arise  in  numerically  based  wavelet  methods.  Therefore,  in  going 
further  in  this  paper,  we  shall  consider  adaptive  approximation  in  this  more  general  tree 
setting  as  described  in  the  following  sections. 

3  Adaptive  tree  approximation 

In  this  section,  we  shall  describe  the  general  setting  of  tree  approximation.  We  assume 
that  we  have  in  hand  an  infinite  tree  T*  (called  the  master  tree )  with  a  finite  number 
of  root  nodes  and  with  the  property  that  each  node  A  of  T*  has  rn  ( A )  children  with 
2  <  m( A)  <  K.  We  allow  the  number  of  children  to  vary  from  node  to  node.  Although 
in  most  applications  m( A)  does  not  depend  on  A. 
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A  subtree  T  of  T*  is  a  collection  of  nodes  A  E  T*  such  that  whenever  A,  A'  are  in 
T  and  A'  ^  A  is  a  descendent  of  A  then  all  of  the  children  of  A  are  also  in  T .  Notice 
that  this  requires  that  whenever  A  is  in  T  then  all  of  the  siblings  of  A  are  also  in  T. 
This  condition,  not  always  assumed  for  a  tree,  is  compatible  with  the  trees  that  arise  in 
adaptive  methods. 

A  subtree  T  is  said  to  be  proper  if  T  contains  all  of  the  root  nodes  of  T*.  The  set  C{T) 
of  leaves  of  T  are  all  of  the  nodes  in  T  for  which  none  of  their  children  are  in  T .  All  other 
nodes  of  T  are  called  interior  nodes.  The  number  of  interior  nodes  of  T  will  be  denoted 
by  N(T).  This  is  also  the  number  of  subdivisions  used  to  create  T.  We  shall  frequently 
use  the  following  remark: 


#(£(T))  <  #(T)  <  2#(£(T)).  (3.1) 

Similarly,  in  the  case  that  all  root  nodes  of  T*  are  interior  nodes  of  T,  we  have 

N(T)  <  #(T)  <  (K  +  1  )N(T)  (3.2) 

and 

#(£(T))  <  K  N(T).  (3.3) 

The  left  inequality  in  (3.1)  is  obvious.  The  right  inequality  follows  by  induction  on  #(T). 
Indeed,  if  we  grow  a  tree  T  by  adding  the  children  of  one  of  its  leaves  A,  then  the  number 
of  nodes  in  T  increases  by  m( A)  and  the  number  of  leaves  grows  by  m( A)  —  1  >  1.  Since 
m( A)  <  2(m(A)  —  1)  we  arrive  at  (3.1).  A  similar  reasoning  gives  (3.2)  and  (3.3).  In 
particular,  one  can  use  that  for  any  tree  T  C  T„ 

#(T)  <  N0  +  I<N(T )  (3.4) 

where  N0  is  the  number  of  root  nodes  of  T*. 

We  assume  that  we  are  given  a  functional  e  which  associates  to  each  node  A  of  2b  a 
nonnegative  real  number  e(A).  Given  any  proper  subtree  T  of  T*  we  define 

E(T)~  Y.  e<A>-  <3'5> 

A &C(T) 

The  reader  should  think  of  e  as  a  bound  for  the  local  approximation  error  and  E  a 
bound  for  the  global  approximation  error  to  a  given  function  /  .  For  example,  in  the  case 
of  piecewise  polynomial  approximation  on  a  simplicial  decomposition,  given  a  function 
/  G  LP(Q),  1  <  p  <  oo,  we  could  define  e(A)  to  be  the  p-th  power  of  the  error  of  best 
Z/p(A)-approximation  to  /  by  polynomials  of  some  fixed  degree  r.  Then  E(T)  is  the  p-th 
power  of  the  global  error  in  approximating  /  in  Lp(Q)  by  piecewise  polynomials  of  degree 
r  with  no  continuity  assumptions  on  the  piecewise  polynomials  across  the  faces  of  the 
simplices  of  C{T).  Another  setting  that  occurs  when  considering  piecewise  polynomial 
approximation  with  continuity  conditions  across  the  boundaries  of  the  simplices  is  that 
e(A)  is  the  p-th  power  of  the  error  of  best  Z/p(A)-approximation  to  /  by  polynomials  of 
some  fixed  degree  r  with  A  a  local  domain  which  contains  A.  The  overlapping  of  the  cells 
A  causes  some  difficulties  which  will  be  overcome  in  the  theory  that  we  develop. 
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Given,  the  functionals  e  and  E.  and  an  arbitrary  integer  n  >  0,  consider  the  class  T„ 
of  all  proper  trees  T  with  N(T )  =  n  and  define 

En:=  min  E{T).  (3.6) 

J-  t  In 

Thus,  En  measures  the  best  error  we  could  obtain  by  utilizing  trees  generated  by  using 
n  subdivisions.  The  main  question  we  wish  to  study  in  this  paper  is  to  what  extent  is 
it  possible  to  find  adaptive  algorithms  which  achieve  approximation  order  like  En(P). 
Since  the  number  of  trees  in  %,  is  exponential  with  respect  to  n,  finding  the  best  tree  T* 
which  achieves  the  error  En  could  be  a  very  costly  operation.  We  shall  show  that  there 
are  constants1  C\ .  C2  >  0  and  numerically  effective  algorithms  to  adaptively  find  trees 
T  E  Tcin  such  that 

E(T)  <  C2En  (3.7) 

where  the  constants  C\  and  C2  depend  only  on  the  integer  K.  We  call  such  an  algorithm 
near  optimal. 

We  shall  measure  the  numerical  cost  of  an  adaptive  algorithm  by  the  number  of  com¬ 
putations  used  in  the  algorithm  where  the  cost  of  computing  e(A)  for  any  given  A  is 
taken  to  be  one.  In  the  next  two  sections,  we  propose  two  algorithms  to  find  a  near  best 
approximant  for  each  n  in  a  numerically  effective  way.  Namely,  the  cost  of  each  of  these 
algorithms  is  linear  in  n.  The  first  algorithm,  given  in  the  next  section,  is  more  restrictive 
in  the  assumptions  it  makes  on  e.  Also,  the  first  algorithm  depends  on  a  thresholding 
parameter  which  is  eliminated  in  the  second  algorithm. 

4  The  first  algorithm 

In  this  section,  we  shall  assume  that  the  functional  e  satisfies  the  following 
Refinement  property:  If  A  is  any  node  of  T*  and  Ai, . . . ,  Am(A)  are  its  children  then 

m(  A) 

£  e(A.)  <  e(A).  (4.1) 

i= 1 

Our  first  algorithm  will  be  based  on  a  threshold  parameter  t  >  0.  Let  T*  be  a  master 
tree  and  let  e  be  a  functional  defined  on  the  nodes  of  T*  satisfying  (4.1)  and  let  E  be 
defined  by  (3.5).  The  basic  idea  of  the  first  algorithm  is  to  subdivide  those  nodes  A 
for  which  e(A)  exceeds  the  threshold  t.  But  this  procedure  may  fail  since  we  are  not 
guaranteed  that  such  subdivisions  actually  effectively  decrease  the  error.  Thus,  we  shall 
modify  the  adaptive  criteria  to  include  a  penalty  term. 

The  intuitive  idea  behind  the  first  algorithm  can  be  explained  by  considering  the 
functional  e  as  specifying  a  certain  energy  at  each  node.  The  inequality  (4.1)  says  that 
the  energy  decreases  (strictly  speaking  does  not  increase)  when  a  node  is  divided  into  its 
children.  We  shall  modify  e  to  obtain  a  new  functional  e  in  order  to  guarantee  that  the 
modified  energy  e  lost  in  such  a  subdivision  is  at  least  t. 

xWe  shall  denote  constants  by  the  generic  C  and  they  may  vary  at  each  occurrence.  Constants  that 
are  important  in  going  further  in  the  paper  will  be  denoted  by  Ci,  c±,  C2,  ■  ■  ■■ 
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Given  A  6  T„  we  let  S( A)  denote  the  set  of  its  siblings  (i.e.  the  collection  of  all 
A'  £  T  which  have  the  same  parent  as  A).  The  quantity 


A(A)  :  = 


e(A) 

X^A'eS'(A)  e(A') 


(4.2) 


will  be  used  in  the  case  the  denominator  is  positive.  It  measures  the  portion  of  the  energy 
in  S( A)  that  is  carried  by  A.  Note  that  X^A'es(A)  -MA')  =  1  for  each  A. 

Let  t  >  0  be  the  thresholding  parameter.  For  each  A  £  T*  with  children  Ai, . . . ,  Am(A), 
we  define 

m(  A) 

d( A)  :=  e(A)  -  £  e(A.)  =:  e(A)  -  <r(A)  (4,3) 

i= 1 

and 

(  0,  if  d( A)  >  t, 

6(A)  :=  {  (4.4) 

(  t  —  d( A),  if  d( A)  <  t. 

The  quantity  5(A)  is  the  amount  of  artificial  energy  we  will  take  away  at  each  node 
subdivision. 

With  the  above  notation  in  hand,  we  now  introduce  a  modified  functional  e  defined 
on  the  nodes  of  T*  by  an  expression  of  the  form 


e(A)  =  e(A)  —  ct(A),  (4.5) 

We  define  ct(A)  =  0  for  all  root  nodes  and  whenever  a  (A)  has  been  defined,  then  we 
define  ck(A')  for  each  child  A'  of  A  by: 

a(A')  :=  A(A')q(A)  +  A(A')5(A).  (4.6) 

The  first  term  in  the  definition  of  a  ( A )  represents  the  portion  of  the  artificial  energy 
already  removed  that  needs  to  be  attributed  to  this  node  and  the  second  term  is  the  new 
energy  to  be  removed  because  of  the  subdivision  of  A.  Let  us  observe  for  later  use: 

Remark  4.1  If  A'  is  a  child  of  A  then 

e(A')  =  e(A')  -  a(A')  =  e(A')(l  -  (4-7) 

so  that  if  one  of  the  children  A'  of  A  has  e(A')  positive  then  all  other  children  A"  will 
have  e(A")  nonnegative. 

The  first  algorithm  will  recursively  generate  trees  To. ... .  Tp,  by  growing.  The  tree 
T0  consists  of  the  set  of  root  nodes  of  T*.  The  remaining  Tk+i  are  generated  from  the 
previous  To  by  using  the  following  criteria. 

FIRST  ALGORITHM:  For  each  A  £  T(Tk),  compute  e(A).  Whenever  e(A)  >  t, 
we  add  all  of  the  children  of  A  to  Tk.  Then  Tk+1  consists  of  Tk  together  with  all  of 
these  children.  The  algorithm  terminates  when  Tk+ 1  =  Tk,  i.e.  when  e(A)  <  t  for  all 
A  £  C(Tk).  The  final  tree  is  denoted  by  T. 
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The  most  important  property  of  this  algorithm  is  that  the  sum  of  modified  errors  e 
looses  locally  a  quantity  of  t  each  time  a  node  is  subdivided  (see  (4.11)  bellow).  To  analyze 
FIRST  ALGORITHM,  we  shall  need  the  following  lemmas. 

Lemma  4.2  Let  t  >  0  and  T  be  the  tree  associated  to  the  FIRST  ALGORITHM.  If 
A  is  a  node  in  the  interior  of  T  and  Ta  is  the  subtree  consisting  of  all  A'  E  T  such  that 
A'  is  a  descendent  of  A,  then 

IffA  <  JV(Ta)t  <  8(A)  <  e(A).  (4.8) 

Proof:  The  left  inequality  in  (4.8)  is  (3.2)  and  the  right  inequality  is  obvious  since  ct(A) 
is  nonnegative.  Therefore,  we  need  only  prove  the  second  inequality.  From  (4.5)  and 
(4.6),  it  follows  that  for  each  A'  E  T*  with  the  children  A'1; . . . ,  A^a,),  we  have 

m(A')  m(A') 

£  S(A')  =  £  e(A')  -  a(A')  -  5(A')  (4.9) 

i— 1  i— 1 

and  therefore 

m(  A')  m(A') 

e(A')-  £  e(A')  —  e(A')  —  a(A')  —  £  e(A')  +  Q(A')  +  ,5(A')  =  d(A')  4-<5(A').  (4.10) 

2  —  1  2  —  1 

Since  d(A')  +  5(A')  >  t,  we  obtain 

m(  A') 

£  e(AJ)  <  e(A')  -  t  (4.11) 

2—  1 

In  other  words,  each  subdivision  reduces  the  energy  e  by  at  least  an  amount  t. 

We  define  T0  to  be  the  subtree  of  Ta  obtained  by  deleting  from  Ta  all  leaves  A'  E 
T(Ta)  for  which  it  and  all  of  its  siblings  are  in  T(Ta). 

If  we  apply  (4.11)  starting  at  the  bottom  of  T0  and  moving  to  the  top  root  A,  we 
obtain 

]T  e(A')  +  tN (T0)  <  e(A).  (4.12) 

A'e£(T0) 

The  sum  on  the  left  side  of  (4.12)  is  over  two  types  of  A'.  The  first  are  those  A'  which 
are  interior  nodes  in  Ta-  Since  these  are  further  subdivided  by  the  algorithm,  they  satisfy 
e(A')  >  t.  There  are  precisely  N(T \)  —  N( T0)  nodes  of  this  type.  The  second  type  of 
node  satisfies  e(A')  >  0  because  of  Remark  4.1.  It  follows  that  the  sum  on  the  left  side 
of  (4.12)  is  >  ^(A^(Ta)  —  A(T0)).  When  this  is  used  in  (4.12),  we  arrive  at  the  second 
inequality  in  (4.8).  □ 

Lemma  4.3  Let  t  >  0  and  let  T  be  the  tree  associated  to  the  FIRST  ALGORITHM. 
If  A  is  a  root  node  ofT  and  Ta  is  any  subtree  of  T  with  single  root  node  A,  then 

£  e(A')<  £  e(A')  +  N(TA)t.  (4.13) 

A'e£(TA)  A'££(Ta) 
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Proof:  If  N(TA)  =  0,  that  is  Ta  =  {A}  then  (4.13)  is  obvious  because  e(A)  =  e(A). 
Therefore,  in  going  further,  we  shall  only  consider  the  case  where  N(TA)  >  1.  From  the 
definition  of  e  in  (4.5),  we  have 

e(A')  =  e(A')  +  5]  a(A').  (4.14) 

A'e£(TA)  A'6U(Ta)  A'e£(TA) 

We  complete  the  proof  by  showing  that 

a(A')  <  N(TA)t  (4.15) 

A  'e£(TA) 

by  using  induction  on  N(TA).  In  the  trivial  case  N(TA )  =  0  we  have  jC(Ta)  =  {A}  and 
e(A)  =  e(A).  When  N(TA )  =  1,  TA  consists  solely  of  A  and  its  children  and  each  of 
these  children  is  a  leaf  of  TA.  Since  ct(A)  =  0  and  for  each  of  the  children  A'  of  A,  we 
have 

a(A')  =  A(A')(a(A)  +  5(A))  <  tX(A') 

because  5(A)  <  t.  Summing  over  A'  and  using  that  ^A,  A(A')  =  1,  we  arrive  at  (4.15) 
in  this  case. 

Suppose  now  that  (4.15)  holds  for  all  trees  TA  with  N(TA )  =  k  and  consider  a  tree 
Ta  with  N(Ta )  =  k  +  1.  Let  A'  be  a  node  whose  children  Aj, . . . ,  A'm^A^  are  leaves  of 
Ta.  The  tree  TA  which  is  obtained  from  TA  by  deleting  all  of  the  children  of  A'  satisfies 
N(T'a )  =  k  and  hence  (4.15)  holds  for  TA.  We  need  only  show  that 

m(  A') 

E  a(A')<a(A')+t  (4.16) 

i= 1 

But  ct(A')  =  A(A')(ct(A/) +  5(A'))  and  S(A')  <  t.  Since  ^  A(A();  =  1,  we  have  (4.16) 
and  have  completed  the  proof  of  the  lemma.  □ 

Theorem  4.4  Let  t  >  0  be  any  threshold  parameter  for  which  the  FIRST  ALGO¬ 
RITHM  gives  a  final  tree  T.  Then, 

E(T)  <  2(K  +  l)Em  (4.17) 

where  m  :=  [N(T)/2].  The  algorithm  uses  at  most  #(T)  <  N0  +  K  N(T)  computations 
of  e  in  generating  T . 

Proof:  The  statement  concerning  the  number  of  computations  of  e  is  clear  from  (3.4). 
To  prove  (4.17),  we  let  T*  be  a  tree  of  order  m  which  satisfies  E(T*)  =  Em.  We  consider 
three  disjoint  sets  Ao,A!,A2  of  root  nodes.  The  first  set  A0  consists  of  all  root  nodes 
that  are  not  subdivided  in  either  T  or  T*.  The  second  set  A,  consists  of  all  root  nodes 
which  are  interior  nodes  in  T*  but  are  not  subdivided  in  T.  The  last  set  A2  consists  of  all 
remaining  root  nodes;  each  of  these  root  nodes  is  an  interior  node  in  T.  For  each  node  A, 
we  let  Ta  be  the  set  of  all  A '  G  T  that  are  descendants  of  A  (such  a  tree  may  be  empty, 
if  A  ^  T).  We  have  that 

E(T)  =  J2  e(A)  +  e(A)  +  £  £  e(A').  (4.18) 

AeA0  AeAl  AeA2  A'e£(TA) 
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To  estimate  E(T),  we  set  E( A0)  :=  X^AeA0  e(A)  f°r  the  first  sum  (4-18)  and  use  Lemma 
4.3  to  each  Ta  in  the  other  two  sums  to  find 

E(T)  <  B(A„)  +  ^  e(A')  +  Y  Y  S(A')  +  JV '(T)t  (4.19) 

A'eAl  AgA2  A'g£(Ta) 

<  E(A0)  +  #(Ai)t  +  (iF  +  l)iV(T)£  , 

where  the  last  inequality  uses  the  fact  that  e(A')  <  t  for  each  A'  £  £(T)  and  that  the 
total  number  of  elements  in  Uaga2T(Ta)  can  not  exceed  K  N(T )  because  each  such  A' 
is  a  child  of  an  interior  node  of  T. 

To  estimate  Em  from  bellow  we  consider  subtrees  Ta  for  nodes  A  £  C{T*)\ A0  which 
are  not  descendants  of  a  root  node  from  Ai.  We  apply  Lemma  4.2  to  each  such  Ta  and 
then  sum  over  A  to  obtain  (note  that  N(T \)  =  0  for  all  other  A  £  C(T*)) 

Em=  Y  e(A)>E(A0)+  Y  N(T^)t  (4-20) 

A  ec(t*)  /\ec(T*) 

The  fact  that  any  interior  node  of  T,  which  is  not  an  interior  node  for  T*,  will  be  in  one 
of  the  Ta  gives  that  the  sum  in  (4.20)  is  at  least  N(T)  —  ( N(T *)  —  #(Ax)).  From  the 
definition  of  m  we  have  N(T)  >  2 N(T*),  and  therefore 

E( A0)  +  #(A ,)t  +  N(T)t/ 2  <  E( A0)  +  (N(T)  -  (N(T*)  -  #(A0))  t  <  Em  .  (4.21) 

A  combination  of  (4.19)  and  (4.21)  gives  (4.17).  □ 

5  The  second  algorithm 

In  this  section,  we  present  a  second  algorithm  for  adaptive  tree 
advantage  is  that  it  weakens  the  subadditivity  condition  (4.1) 
following  subadditivity  assumption: 

Subadditivity:  For  each  A  £  T*  and  any  finite  tree  Ta  C  T* 
we  have 

Y  e(A')  <  C„e( A)  (5.1) 

A  ’ec(rA) 

with  C0  >  1  an  absolute  constant. 

This  will  be  important  in  Finite  Element  applications  where  the  error  functional  e  cor¬ 
responds  to  local  approximation  on  sets  A  which  overlap.  We  shall  actually  show  in  §7 
that  this  subadditivity  condition  can  be  weakened  even  further  so  as  to  be  applicable  in 
general  finite  element  settings. 

As  was  the  case  in  the  first  algorithm,  we  shall  introduce  a  modified  error  functional 
e  which  will  drive  the  algorithm.  Initially,  for  all  of  the  root  nodes  of  T*,  we  define 
e(A)  =  e(A).  Now,  assuming  that  e(A)  has  been  defined  for  a  A  £  T*,  we  define  e  for 
each  child  Aj,  j  =>  1, . . . ,  m( A),  of  A  as 

e(A,):=,(A)  (5.2) 


approximation.  Its  main 
by  replacing  it  with  the 

with  single  root  node  A, 
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where 


g(A)  := 


ra(  A) 

E  e<Ai) 

e(A)+e(A)e(A)' 


(5.3) 


In  other  words,  e  is  constant  on  the  children  of  A.  It  is  useful  to  define  the  penalty  terms 


P(A,)  :  |A|  ,5.4) 

which  describes  how  e  differs  from  e.  Note  that  the  p{Aj)  satisfy  the  equality 

ra(  A) 

p{Aj)  =  p{A)  +  1 .  (5.5) 

3  = 1 

which  is  the  main  property  we  shall  need  for  e. 

Let  us  note  that  it  follows  from  (5.5)  that  for  any  proper  subtree  T  C  T*  we  have 


^  p(A)  =  N0  +  N(T)  (5.6) 

A  eC(T) 

where  N0  is  the  number  of  root  nodes  of  T*  and  N(T)  is  as  before  the  number  of  subdivi¬ 
sions  used  to  create  T.  This  follows  by  induction  on  N(T).  Similarly  if  Ta*  is  a  tree  with 
a  single  root  node  A*  then 


p(A)=p(A-)  +  JV(Ta.).  (5.7) 

As£(Ta.) 

SECOND  ALGORITHM:  This  algorithm  creates  a  sequence  of  trees  T  =  Tj,  j  = 
1,2,...  as  follows.  For  j  =  0,  T0  is  the  collection  of  root  nodes  of  T If  Tj- 1  has  been 
defined,  we  examine  all  leaves  of  Tj_i  and  subdivide  the  leaves  A  with  largest  e(A)  to 
produce  Tj.  (In  the  case  there  are  several  largest,  we  subdivide  all  of  them.) 

Remark  5.1  Note  that  #(Tj)  <  Kff(Tj_i)  for  each  j  =  1,  2, . . . 

The  following  theorem  which  is  the  analogue  of  Theorem  4.4,  analyzes  the  performance 
of  the  second  algorithm. 

Theorem  5.2  There  is  an  absolute  constant  C  >  0  such  that  for  each  j  =  0,1,..  .r  the 
output  tree  T  =  Tj  of  the  SECOND  ALGORITHM  satisfies 

E(T)  <  CEm  (5.8) 

whenever  m  <  n/(2K  +  2)  and  n  :=  N(T).  To  create  T,  the  algorithm  uses  <  C(n  +  N0 ) 
arithmetic  operations  and  computations  of  e,  where  N0  is  the  number  of  root  nodes  of  T* . 
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Proof:  The  proof  of  this  theorem  has  many  aspects  in  common  with  Theorem  4.4.  We 
consider  any  j  =  0, 1, . . .  such  that  T  =  T)  satisfies  N(T)  =:  n  >  2 (K  +  1)  and  we  let 
T*  be  the  best  tree  with  N(T*)  =  m  with  m  <  n/(2K  +  2).  We  shall  also  assume  that 
each  root  node  has  been  subdivided  in  the  creation  of  either  T  or  T*.  If  this  is  not  the 
case,  one  would  derive  the  theorem  for  the  new  master  tree  obtained  by  deleting  the  root 
nodes  not  divided  and  then  derive  the  result  for  the  original  tree  from  this. 

Let  t  be  the  smallest  value  attained  by  e  on  the  interior  nodes  of  T.  We  let  A  := 
{A  :  e(A)  =  t\  be  the  collection  of  interior  nodes  of  T  where  e  takes  on  the  value  t. 
We  derive  an  upper  bound  for  E(T)  =  X^Ae£(T)  e(A)  by  breaking  T  into  T  =  T0  U  Ti 
where  Ti  :=  Uaga^a  and  for  each  A,  Ta  is  the  subtree  of  T  consisting  of  A  and  all  of  its 
descendants  in  T.  The  tree  T0  is  obtained  by  deleting  all  the  proper  descendants  of  all 
of  the  A  G  A  (i.e.  we  leave  each  of  the  A  G  A  in  T0  but  remove  all  of  their  descendants). 
It  follows  that  C(T)  =  £0  U  L\  where  =  £(Tx)  and  C0  =  C( T0)  \  A.  This  gives 

B(T)=  £  e(A)  +  £  e(A)**£0  +  E,.  (5.9) 

Aec0  AeCi 

We  will  bound  each  of  the  two  sums  appearing  on  the  right  side  of  (5.9).  First,  we 
note  that  from  the  subadditivity  (5.1),  it  follows  that  for  each  A  G  A,  we  have 


£  e(A')  <  Coe  (A)  =  C0e(A)p(A). 

(5.10) 

A  '€C(Ta) 

Therefore, 

£.  =  E  E  e(A')  <  C„  E  e(A)p(A). 

(5.11) 

AeAA  ’eC(TA)  AeA 

Since  E0  =  X^ag£0  ^(A)p(A),  we  have 

Eo  +  IAEC'o  £  e(A)p(A). 

(5.12) 

AEjC(To) 

We  next  want  to  note  that 

e(A)  <t,  A  G  £(T0). 

(5.13) 

This  is  the  case  for  any  A  G  A  by  the  very  definition  of  t  (note  that  any  A  G  A  is  a  leaf  of 
T0).  To  see  (5.13)  for  any  other  leaf  of  T0  (i.e.  those  not  in  A)  consider  the  state  of  affairs 
when  all  of  the  A  G  A  have  already  been  generated  by  the  algorithm  and  the  A  G  A  are 
to  be  subdivided.  Let  V  C  T  be  the  tree  representing  the  state  of  the  algorithm  at  this 
point.  We  claim  that  any  A  G  £(T0)  that  is  not  in  A  is  already  a  leaf  of  V  and  satisfies 
e(A)  <  t.  In  fact,  if  e(A)  >  t,  then  we  would  be  subdividing  A  and  not  the  elements  in 
A.  So  e(A)  <  t  (it  cannot  be  equal  to  t  since  otherwise  A  would  be  in  A)  and  by  the 
definition  of  t,  A  cannot  be  an  interior  node  to  T  and  therefore  must  be  a  leaf  of  T.  Thus, 
we  have  verified  (5.13).  Using  this  in  (5.12)  we  obtain 

E(T)  =  S0  +  Si  <  U0  J]  e(A)p(A)<C0t(N0  +  N(T0))<C0t(N0  +  N(T))  (5.14) 

Ae£(T0) 

where  we  have  used  (5.6)  in  the  next  to  last  inequality. 


11 


We  shall  now  prove  that  t(N0  +  N(T ))  <  CEm.  Let  C*  be  the  collection  of  all  leaves 
A*  £  C(T*)  which  are  in  the  interior  of  T  and  for  each  of  these  A*  let  Ta*  be  the  tree 
consisting  of  all  A  £  T\C(T )  such  that  A  is  a  descendant  of  A*.  For  each  leaf  A  £  T(Ta* ), 
we  have  A  is  in  the  interior  of  T  and  therefore  e(A)  >  t  by  the  very  definition  of  t.  So 
we  can  apply  (5.f)  to  find 

t  P (A)  <  P(A)e(A)  =  e(A)  -  C'oe(A*).  (5.15) 

A  G£(TA»)  A  G£(Ta»)  Ae  £(ta») 

We  now  sum  (5.15)  over  all  A*  £  C*  and  use  (5.7)  to  obtain 

tM  <  t  E  E  p(A)  <  C0  J2  e(A*)<C0Em  (5.16) 

A*e£*  Ag£(TA»)  A *<e£* 

where 

M:=  J2  NF*).  (5.17) 

A*ec* 

To  conclude  the  proof,  we  shall  show  that 

N0  +  N(T)  <  2(K  +  2)M  .  (5.18) 

First,  let  T'  be  the  tree  obtained  from  T  by  deleting  all  of  the  leaves  of  T.  Then,  T' 
contains  Ta*  for  all  A*  £  C* . 

In  the  trivial  case  N(T')  <  N0  we  use  (3.4)  to  receive 

n  =  N(T)  =  #(T')  <  N0  +  K  N(Tr)  <  (K  +  1)A0  (5.19) 

and  therefore  m  <  n/(2K  +  2)  <  N0/2.  Hence  there  are  at  least  N0  —  m  >  N0/ 2  root 
nodes  of  T*  that  are  not  subdivided  in  T*  and  according  to  our  assumption  they  are 
subdivided  in  T.  Thus  M  >  N0/ 2  and  returning  to  (5.19),  we  obtain  (5.18). 

In  the  case  N0  <  N(T')  from  (3.4)  we  know  that 

N0  +  N(T)  =  N0  +  #(T')  <  N0  +  N0  +  K  N{T)  <  {K  +  2 )N(T').  (5.20) 

Finally,  we  claim  that 

N(T')  <  2 M  (5.21) 

which  will  complete  the  proof  of  (5.18)  in  this  case.  To  see  (5.21),  we  note  that  the  only 
interior  nodes  of  T'  that  are  not  interior  nodes  in  one  of  the  Ta*  are  those  which  are 
interior  nodes  in  T*.  This  means  that  there  are  at  most  N(T*)  <  rn  of  them.  Hence, 

N(T')  <  M  +  m<  M  +  <M  +  N°  +  ^  N^T  ^  <  M  +  JV(T') /2  (5.22) 

2  K  +  2  2  K  +  2 

where  we  have  used  the  property  N(T)  =  n  >  (2 K  +  2 )m  of  the  theorem,  and  (3.4).  □ 

Remark  5.3  In  the  SECOND  ALGORITHM  the  use  of  the  largest  e(A)  may  require 
a  sorting  procedure  with  complexity  0{N  log  N) .  To  keep  the  number  of  operations  of 
order  0(N )  we  can  use  binary  bins  instead  of  sorting.  Then  after  e(A)  is  calculated,  we 
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determine  an  integer  k  such  that  2K  <  e(A)  <  2K+1,  and  place  A  into  a  bin  with  index  k. 
We  now  choose  for  subdivision  the  leaves  from  the  bin  with  the  largest  possible  index  and 
proceed  as  in  the  SECOND  ALGORITHM.  Theorem  5.2  remains  valid  for  this  variant 
of  the  algorithm.  The  proof  is  the  same  with  the  following  modifications.  We  define  t  =  2K 
where  k  is  the  smallest  index  of  a  bin  which  contains  an  interior  node  ofT.  The  set  A  is 
now  the  collection  of  interior  nodes  ofT  which  belong  to  that  bin.  Then  t  <  e(A)  <  2 1  for 
A  E  A,  and  subsequently  e(A')  <  2 1  for  A'  E  V .  The  only  other  changes  are  to  replace  t 
with  2 1  in  the  upper  estimates  (5.13)  and  (5.1  f). 

Most  numerical  implementations  of  adaptive  algorithms  such  as  those  that  we  have 
given  would  have  the  goal  of  producing  a  tree  T  such  that  E(T )  <  //  where  p  >  0  is  some 
prescribed  tolerance.  In  the  case  of  SECOND  ALGORITHM  this  can  be  accomplished 
simply  by  introducing  an  error  check  after  each  step  of  the  algorithm. 
THRESHOLDING  SECOND  ALGORITHM:  Given  an  error  tolerance  p  >  0,  this 
algorithm  produces  a  tree  such  that  E(T <  p  as  follows. 

(i)  Compute  e(A)  =  e(A),  A  E  T0,  and  then  E(T0)  for  T0  the  set  of  root  nodes  ofT*. 
Define  T  =  T0  and  proceed  to  step  (ii). 

(ii)  If  E(T)  <  p,  then  define  :=  T  and  stop  the  algorithm.  If  E(T)  >  p,  then  proceed 
to  step  (in). 

(in)  Given  a  proper  subtree  T  ofT*,  compute  p  :=  maxA'6jc(T)  e(A')  and  subdivide  all 
A  E  jC(T)  for  which  e(A)  =  p  thereby  obtaining  the  new  tree  T' .  Redefine  T  to  be  T'  and 
proceed  to  step  (ii). 

Corollary  5.4  There  are  absolute  constants  C\,c\  >  0  such  that  for  any  error  tolerance 

p  >  0,  the  tree  produced  by  the  THRESHOLDING  SECOND  ALGORITHM 

has  the  properties: 

(i)  If  T  dT*  is  any  tree  satisfying  E(T)  <  cip  then 

N(T„)  <  CiN(T).  (5.23) 


(ii)  The  number  of  evaluations  of  e  and  the  number  of  arithmetic  operations  in  producing 
T u  is<Cl(N0  +  N(Tli)). 

Proof:  Statement  (ii)  is  clear.  To  prove  (i),  we  take  c\  =  A ( C  +  l)-1  where  C  is  the 
constant  in  Theorem  5.2  and  A  E  (0, 1/4)  is  a  constant  which  depends  only  on  K  and  will 
be  prescribed  in  the  course  of  the  proof.  Let  T'  be  the  tree  before  the  last  subdivisions  are 
made  to  produce  T  :=  TjL.  Then,  E{T')  >  p.  We  define  T*  as  the  best  tree  approximation 
with  m  :=  N(T*)  =  N(T)  subdivisions.  We  shall  consider  three  cases: 

CASE  1:  E(T)  >  A p.  We  use  Theorem  5.2  for  T.  Since 

Em  =  E(T*)  <  E(T )  <  Clp  <(C+  1  )“1E(T), 

this  theorem  says  that  m  >  N[T)/(2K  +  2)  which  proves  (i)  in  this  case. 

CASE  2:  N(T' )  >  A N(T).  In  this  case,  we  start  with  the  inequality 

E{T*)  <  E(T)  <  ci p  <  ci E(T')  <  (C  +  l)~lE(T'). 


13 


We  now  use  Theorem  5.2  for  T'  and  deduce  that  m  >  N(T') / (2 K  +  2)  >  XN(T) / (2 K  +  2) 
which  proves  (i)  in  this  case. 

CASE  3:  E(T )  <  Xp  and  N(T')  <  X N(T).  This  case  is  the  most  complicated.  Let  A 
be  the  set  of  nodes  in  T0  which  were  subdivided  in  producing  T  from  T'  and  let  D  be 
the  set  of  all  other  nodes  in  T'  which  were  subdivided  in  producing  T.  We  further  set 
a  :=  #(A),  b  :=  #(P).  For  each  A  G  A  U  B  we  have  e(A)  =  p  :=  maxAeg(T')  e(A).  We 
also  have  N(T ")  >  K~lb  and  N(T')  <  X N(T)  =  A (b  +  a  +  N(T'))  which  implies  that 

b  <  2 A Ka  (5.24) 

provided  —  X  >  which  is  true  whenever  A  is  sufficiently  small  (depending  only 
on  K). 

We  next  want  to  show  that  Eg  :=  e(^)  smaF  compared  with  :=  e(^)- 

For  this,  we  introduce  Tg  which  is  the  subtree  of  T'  consisting  of  D  and  all  of  its  ancestors 
and  Nb  which  is  the  number  of  root  nodes  of  Tg.  We  use  (5.7)  to  find 

Eg  =  ^  p(A)e(A)  =  p(NB  +  N(Tb ))  <  2 N(TB)p  <  4 bp,  (5.25) 

A  eB 

where  the  last  inequality  used  (3.1).  On  the  other  hand, 

S a  =  Y^  e(A)  =  =  ap ’  (5-26) 

AeA  AeA 

because  e(A)  =  e(A)  for  root  nodes.  From  (5.24),  we  obtain 

Eg  <  Abp  <  8XKap  <  8XKXA.  (5.27) 

Since 

Sa  +  Eg  >  E(T')  -  E(T)  >  (1  -  X)p,  (5.28) 

(5.27)  implies  that 

Ea  >  p/2  (5.29) 

provided  we  choose  A  small  enough  (depending  only  on  K).  We  now  fix  A  =  1/(8 AT  +  2) 
to  satisfy  the  above  restrictions.  Since  E(T*)  <  c i  //  <  A//  <  p/ 4.  at  least  one  half  of  the 
nodes  in  A  cannot  be  in  £(T*)  and  therefore  were  subdivided  in  producing  T* .  That  is 
a/2  <  N(T *)  =  N(T).  On  the  other  hand, 

(1  -  A )N(T)  <  N(T)  -  N(T')  =  a  +  b  <  a(l  +  2XI<)  <  2(1  +  2A K)N(T)).  (5.30) 

This  proves  (i)  in  this  final  case  and  completes  the  proof  of  the  corollary.  □ 

6  Piecewise  polynomial  approximation 

The  remainder  of  this  paper  will  be  devoted  to  giving  examples  of  settings  where  the  above 
algorithms  can  be  employed.  In  this  section,  we  give  our  first  example,  which  is  quite 
simple.  We  suppose  that  Pq  is  an  initial  triangulation  of  a  polygonal  domain  Q  C  lRd 
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and  we  have  in  hand  a  specific  subdivision  rule  that  tells  us  how  to  subdivide  any  given 
triangle  that  arises  during  a  subdivision  process  (  i.e.  we  have  the  master  tree  T*  for  P0 
and  this  subdivision  rule).  We  consider  the  problem  of  approximating  a  given  function 
/  e  LP(Q),  1  <  p  <  oo,  by  piecewise  polynomials  of  degree  r  >  0  on  partitions  P  which 
correspond  to  subtrees  of  T*.  Let  Lfi  denote  the  space  of  polynomial  of  degree  r.  We 
shall  assume  there  is  no  continuity  restrictions  on  these  piecewise  polynomials  across  the 
boundaries  of  the  simplicial  cells  which  make  up  P. 

We  fix  1  <  p  <  oo  and  /  6  Lp( Q)  and  define 

e(A):=  mfJ|/-P|rMA),  A6t  (6.1) 

Corresponding  to  this  e,  for  any  tree  T  C  T*,  we  define 

£(T):=  V  e(A).  (6.2) 

As  C(T) 

We  recall  that  the  trees  T  C  T*  are  in  one  to  one  correspondence  with  adaptively  generated 
partitions  P.  Namely,  the  leaves  C(T)  form  a  partition  P  =  P(T)  which  is  obtained  by 
applying  N(T)  subdivisions.  Let  us  denote  by 

E(P)  :=  E(T(P ))  (6.3) 

which  is  the  p-th  power  of  the  error  in  approximating  /  in  the  Lp(fl)  norm  by  piecewise 
polynomials  of  degree  r  on  the  partition  P.  Let  us  further  denote  by  Vn  the  set  of  all 
partitions  P  which  can  be  generated  by  at  most  n  subdivisions  of  P0. 

Because  of  the  disjoint  supports  of  the  triangular  cells  A  in  C(T)  and  the  set  subad¬ 
ditivity  of  ||  •  \\pLp,  we  have  that  condition  (4.1)  holds.  This  means  we  can  employ  the  first 
algorithm  of  §4.  Thus,  we  can  apply  either  the  first  or  second  algorithm  in  this  setting. 
Either  of  Theorem  4.4  or  Theorem  5.2  gives 

Theorem  6.1  If  f  e  LP(Q),  then  the  FIRST  ALGORITHM  of  §4  and  the  SECOND 

ALGORITHM  o/§5  when  applied  with  e  of  (6.1)  both  generate  a  tree  T  whose  leaves 
C{T)  form  a  partition  P  =  P(T)  such  that 

E(P)  <  C\  inf  E(P'),  (6.4) 

P  £.Pm 

whenever  m  <  C\N(T),  where  Ci,ci  >  0  are  absolute  constants. 

7  Piecewise  linear  approximation  with  continuity  across 
boundary  edges 

In  this  section,  we  want  to  show  how  to  apply  the  principles  of  the  second  algorithm  in 
a  typical  Finite  Element  application.  We  suppose  that  Po  is  an  initial  triangulation  of 
a  polygonal  domain  C  IR 2  and  we  have  in  hand  a  specific  subdivision  rule  that  tells 
us  how  to  subdivide  any  given  triangle  that  arises  during  a  subdivision  process  (  i.e.  we 
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have  the  master  tree  T*  for  P0  and  this  subdivision  rule).  There  are  two  new  ingredients 
that  we  wish  to  incorporate  into  our  analysis.  The  first  is  the  appearance  of  hanging 
nodes.  Given  a  partition  P  that  is  a  refinement  of  P0,  we  let  Vp  denote  the  collection  of 
all  vertices  of  the  triangles  which  make  up  P.  Such  a  vertex  v  is  called  a  hanging  node 
for  A  G  P  if  it  appears  in  the  interior  of  one  of  the  edges  of  A.  We  shall  say  that  a 
partition  P  is  admissible  if  it  has  no  hanging  nodes.  A  completion  P  of  a  partition  P  is 
an  admissible  partition  which  is  a  refinement  of  P  (using  the  specified  subdivision  rule). 

The  second  ingredient  which  will  cause  us  some  difficulty  is  that  the  typical  error 
functionals  e  which  arise  in  Finite  Element  Applications  will  typically  not  satisfy  our 
condition  (5.1).  The  reason  for  this  is  that  the  Galerkin  method  requires  that  the  approx- 
imants  have  some  smoothness  which  usually  implies  at  least  continuity  of  the  piecewise 
polynomials.  Therefore  the  piecing  together  of  local  approximants  to  obtain  a  good  global 
approximation  prevents  e(A)  from  being  completely  localized  to  A  and  as  a  result  (5.1) 
will  typically  fail  to  hold. 

We  shall  show  how  to  circumvent  these  difficulties  in  this  section.  We  shall  limit  our 
discussion  to  one  specific  subdivision  rule  (newest  vertex  bisection)  which  is  discussed 
and  utilized  in  the  paper  [1],  In  principle,  the  results  of  this  section  could  be  extended 
to  other  subdivision  rules  but  this  would  require  the  development  of  specific  properties  of 
the  completion  process  for  these  rules.  These  properties  were  developed  in  [1]  for  newest 
vertex  bisection  and  required  a  nontrivial  analysis  to  do  so. 

We  first  introduce  and  discuss  the  properties  of  newest  vertex  bisection  that  we  shall 
need.  The  proofs  of  these  properties  not  given  here  will  all  be  found  in  [1],  Given  an 
initial  admissible  partition  P0  of  Q,  we  give  each  edge  of  this  partition  a  label  of  0  or  1  in 
such  a  way  that  for  each  triangular  cell  exactly  one  of  its  edges  has  the  label  0.  That  such 
a  labelling  is  possible  is  proved  in  Lemma  2.1  of  [1].  The  edge  in  A  given  the  value  0  will 
be  denoted  by  E( A).  The  vertex  opposite  this  edge  is  called  the  newest  vertex  for  A  and 
is  denoted  by  v(A).  The  rule  for  newest  vertex  subdivision  of  a  cell  A  is  to  insert  a  new 
edge  connecting  v(A)  to  the  midpoint  of  E( A)  thus  producing  two  new  cells  A1;  A2  (the 
children  of  A).  These  two  new  cells  are  given  the  new  vertex  which  is  the  midpoint  of 
E( A).  The  three  new  sides  produced  (i.e.  the  new  diagonal  and  the  two  new  sides  arising 
from  the  bisection  of  E( A))  are  given  the  label  2.  Thus  at  this  stage  each  cell  will  have 
the  label  (f,  1,  0)  (if  it  has  not  been  subdivided)  or  (2,2,  f)  (if  it  is  a  new  cell  obtained 
by  subdivision)  and  the  newest  vertex  is  always  opposite  the  side  with  smallest  label.  In 
general,  any  cell  appearing  in  newest  vertex  bisection  has  the  label  ( k  +  1,  k  +  1,  k)  with 
newest  vertex  the  one  opposite  the  smallest  labelled  side.  When  this  cell  is  subdivided 
it  produces  two  children  (using  the  newest  vertex  bisection  rule)  which  have  the  labels 
( k  +  2,  k  +  2,  k  +  1 )  and  we  have  the  same  property  that  the  side  opposite  the  smallest 
label  is  the  newest  vertex. 

In  going  further  in  this  section,  T„  will  denote  the  master  tree  for  newest  vertex 
subdivision  as  described  above.  So  T*  is  a  binary  tree.  We  denote  as  before  Tn  the  class  of 
proper  trees  with  N(T )  =  n  and  by  Tfi  the  class  of  admissible  trees.  Thus  T  e  Tfi  means 
that  T  G  Tn  and  that  its  leaves  form  an  admissible  partition  P  (i.e.  P  has  no  hanging 
nodes).  We  have  already  alluded  to  the  following  fact: 

Remark  7.1  Any  tree  T  £  Tn  can  be  refined  to  find  a  tree  T  which  is  admissible  and 
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satisfies 


N(T)  <  C2N(T) 


(7.1) 


with  C2  an  absolute  constant. 

Proof:  In  essence,  this  is  Theorem  2.4  of  [1].  However  that  theorem  is  not  stated  in  the 
form  of  (7.1)  so  we  must  say  a  few  words  on  how  to  obtain  (7.1).  Let  A40  denote  the 
set  of  all  root  nodes  in  T*  which  are  interior  nodes  of  T  (i.e.  they  are  subdivided  in  the 
creation  of  T).  We  let  T[  denote  the  tree  obtained  by  subdividing  exactly  the  cells  in 
A4q.  We  then  denote  by  T\  the  completion  of  T[.  Next,  we  let  A4  \  denote  the  set  of  all 
leaves  of  T\  which  are  interior  nodes  of  T.  We  let  T'2  denote  the  tree  obtained  from  T\ 
by  subdividing  precisely  the  nodes  in  A4 1  and  let  T2  denote  the  completion  of  Tfi  We 
continue  in  this  way  until  the  smallest  value  of  k  where  M.k  =  0.  We  have  thus  reached  a 
tree  T  :=  Tk  which  is  admissible  and  is  a  refinement  of  T.  Theorem  2.4  of  [1]  proves  that 

#(P0)  +  N(T)  <  #(P0)  +  C{#{M0)  +  •  •  •  +  #(Mk-i))  <  #(P0)  +  C  N(T)  (7.2) 

with  C  an  absolute  constant.  Here,  in  the  last  inequality  we  used  the  fact  that  each  node 
in  is  an  interior  node  of  T  by  the  very  definition  of  the  set  A4j.  □ 

We  want  next  to  define  the  types  of  error  functional  e  we  shall  associate  to  newest- 
vertex  bisection.  To  do  this,  we  introduce  the  minimal  ring  associated  to  a  triangular  cell 
A  G  T*.  Given  any  admissible  partition  P  and  A  G  P,  we  define 

7?(A,P)  :=  UA'eP.AnAVS^V  (7.3) 

which  is  the  first  ring  about  A.  This  ring  depends  on  P.  However,  we  can  find  a  minimal 
ring  about  A  which  does  not  depend  on  P.  Namely,  we  define 

P_(A)  :=  flpP(A,P)  =  Ua'6P_(A)A'  (7.4) 

where  the  intersection  is  taken  over  all  admissible  partitions  P.  Here  the  set  P_( A)  is 
the  collection  of  cells  from  T*  which  touch  A  and  make  up  P_(  A)  .  The  structure  of  the 
set  P_( A)  has  been  given  in  Lemma  4.3  of  [1]. 

We  can  now  define  functionals  e  on  T*  for  which  we  can  develop  an  analogue  to  the 
results  of  §5.  These  functionals  start  with  a  function  norm  ||  •  ||  which  has  a  power  which 
is  set  subadditive.  Examples  are  any  of  the  Lp,  1  <  p  <  oo,  norms,  whose  p-th  power  has 
this  property,  or  any  of  the  Sobolev  Hs  =  Ws(L2(f2))  norms,  s  >  0,  whose  square  has 
this  property.  We  shall  limit  our  discussion  to  the  latter  case  since  it  is  of  most  interest  in 
Finite  Element  applications.  So,  fix  0  <  s  <  1,  and  let  ||  •  ||  denote  the  Hs(fil)  norm  with 
Q  the  polygonal  domain  corresponding  to  the  partition  P0.  We  fix  a  function  /  G  /P(Q) 
and  for  each  A  G  T„  we  define 

e(A)  :=  inf  \\f  -  S\\2Ha(R_(A))  (7.5) 

where  the  infimum  is  taken  over  all  continuous  piecewise  linear  functions  S  defined  on 
P_( A)  which  are  subordinate  to  P_( A).  We  define  E(T )  for  this  e  as  we  have  before. 
We  also  define  the  best  error  Ern  but  now  the  competition  is  restricted  to  admissible 
partitions: 

Em  :=  inf  E(T).  (7.6) 

TeTa 

1  m 
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We  shall  now  modify  the  second  algorithm  so  as  to  generate  admissible  partitions. 
Some  modifications  are  also  made  to  make  the  proof  of  the  following  theorem  proceed 
without  difficulty. 

While  we  view  our  goal  as  to  create  partitions  based  on  newest  vertex,  we  shall  actually 
use  the  following  subdivision  rule: 

NEW  SUBDIVISION  RULE:  If  A  is  a  triangular  cell,  then  it  is  subdivided  into  32 
children  which  are  obtained  by  applying  newest  vertex  bisection  5  times  uniformly  on  A. 
In  other  words,  the  children  of  this  subdivision  rule  are  all  fifth  generation  offspring  of  A 
obtained  by  the  newest  vertex  bisection  rule. 

Our  reason  for  applying  so  many  newest  vertex  bisections  in  just  one  primary  subdi¬ 
vision  of  the  NEW  SUBDIVISION  RULE  is  simply  that  it  guarantees  the  following 
property: 

Property  I:  When  applying  the  NEW  SUBDIVISION  RULE  to  a  triangular  cell  A, 
among  the  32  children  of  A  there  is  one  child  A1  whose  minimal  ring  R-(A')  of  (7.5) 
(still  defined  according  to  newest  vertex  insertion)  is  completely  contained  in  A. 

Indeed,  take  for  A'  any  child  which  does  not  touch  the  boundary  of  A  (see  marked  trian¬ 
gles  in  Figure  7.1). 


Figure  7.1:  Application  of  the  NEW  SUBDIVISION  RULE  to  a  triangle 


To  define  the  modified  second  algorithm,  we  use  the  definition  of  e  for  the  NEW 

SUBDIVISION  RULE. 

MODIFIED  SECOND  ALGORITHM:  Given  as  input  a  target  accuracy  [x,  this 
algorithm  produces  an  admissible  tree  T  satisfying 

E(T)  <  (7.7) 

as  follows.  For  j  =  0,  T0  is  the  collection  of  roots  ofT: *.  For  each  j ,  we  do  the  following: 

(i)  Compute  E(Tj).  If  E{Tj)  <  \1fC2,  with  C2  the  constant  of  (7.1)  go  to  (in).  If 
E(Tj)  >  11/C2  go  to  (ii). 

(ii)  We  examine  all  leaves  ofTj  and  subdivide,  according  to  the  NEW  SUBDIVISION 
RULE  all  of  the  leaves  A  with  largest  e(A)  to  produce  TJ+  l .  We  replace  j  by  j  +  1  and 
return  to  (i). 

(Hi)  We  define  T  as  the  completion  ofTj.  This  completion  is  made  according  to  the 
newest  vertex  bisection  completion  and  therefore  satisfies  N(T)  <  C^NifFj). 
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The  following  theorem  describes  the  optimal  properties  of  this  algorithm. 

Theorem  7.2  There  are  absolute  constants  C3,  c3  >  0  such  that  such  that  the  output  T 

of  the  MODIFIED  SECOND  ALGORITHM  satisfies 

E(T)  <  C3Em  (7.8) 

whenever  m  <  c3n  and  n  :=  N(T).  Here,  Em  is  defined  in  the  competition  among 
all  admissible  subtrees  of  the  newest  vertex  tree  T*.  To  create  T,  the  algorithm  uses 
<  Cfin  +  No)  arithmetic  operations  and  computations  of  e,  where  N0  is  the  number  of 
root  nodes  ofT *. 

It  is  clear  at  the  outset  that  the  tree  T  of  the  MODIFIED  SECOND  ALGO¬ 
RITHM  is  a  subtree  of  T*  and  it  is  admissible.  The  remainder  of  this  section  will  be 
devoted  to  proving  this  theorem  which  is  a  modification  of  the  proof  of  Theorem  5.2. 
We  let  T*  be  an  admissible  tree  with  N(T*)  =  m  which  attains  Em.  As  in  the  proof  of 
Theorem  5.2,  we  assume  (without  loss  of  generality)  that  all  root  nodes  are  interior  nodes 
(i.e.  have  been  subdivided)  in  either  T  or  T*,  and  resolve  the  trivial  case  N(T')  <  N0. 
We  shall  prove  two  inequalities: 

(a)  E(T)  <  Ct(N0  +  N(T)). 

(b)  t{N0  +  N(T))  <  CEm  if  m  <  cn 

where  the  constants  are  absolute  and  t  >  0  is  the  value  specified  as  in  the  proof  of  Theorem 
5.2.  These  two  inequalities  combine  to  give  the  theorem. 

Proof  of  (a): 

If  we  had  the  original  subadditivity  property  (5.1)  for  e,  the  proof  would  be  trivial. 
Indeed,  we  would  know  (a)  with  N(Tj )  in  place  of  N(T)  as  it  is  in  (5.14).  Then,  we  can 
replace  N(Tj)  by  N(T)  because  of  (7.1).  However,  this  subadditivity  does  not  necessarily 
hold.  Instead,  we  have  the  following  weaker  condition: 

Modified  subadditivity:  For  any  A  G  T*  and  any  admissible  finite  tree  Ta  with  single 
root  node  A,  we  have 

E  e(A')  <  C„e( A)  (7.9) 

A  ’ec(rA) 

with  C0  an  absolute  constant. 

This  property  follows  from  the  set  subadditivity  of  ||  •  ||2  and  the  fact  that  any  point 
x  G  can  appear  in  at  most  ko  of  the  minimal  rings  i?_( A'),  A'  G  £(Ta)  where  ko  is  an 
absolute  constant  (see  the  simple  proof  in  [1]  which  uses  the  fact  that  Ta  is  admissible). 

Let  us  now  see  how  to  use  this  modified  subadditivity  to  prove  (a).  Let  T  =  Tj  be 
the  final  partition  obtained  by  applying  the  MODIFIED  SECOND  ALGORITHM 
and  let  T  be  the  completion  of  T  which  is  obtained  by  additional  subdivisions.  We  know 
from  (7.1)  that 

N(T)  <  C2N(T).  (7.10) 

As  in  Theorem  5.2,  we  let  t  be  the  smallest  value  attained  by  e  on  the  interior  nodes 
of  T  and  let  A  :=  {A  :  e(A)  =  t}  be  the  collection  of  interior  nodes  of  T  where  e  takes 
on  the  value  t.  We  define  T0,T!  and  Ta,  A  G  A,  C0,  E0,  and  £|  as  in  the  proof  of 
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Theorem  5.2.  These  trees  and  sets  of  leaves  are  defined  relative  to  T  (and  not  T).  Thus 
each  of  the  trees  Ta  is  admissible.  Then  as  before, 

£(T)=  Y,  e(A>+  E  <KA)  =  S„  +  £i.  (7.11) 

Ae£0  AeUi 

Since  Ta  are  admissible  trees,  it  follows  that  for  each  A  G  A,  we  have 

]T  e(A')  <  C0e(A)  =  C0e(A)p(A)  (7.12) 

A' 6  C(TA) 

where  now  we  have  used  the  modified  subadditivity  (7.9).  We  can  continue  then  with  the 
reasoning  of  Theorem  5.2  and  arrive  at 

So  +  £i<Co  g(AMA)-  (7.13) 

Aen(To) 

In  contrast  to  Theorem  5.2,  we  do  not  know  that  e(A')  <  t,  A'  G  £(T0),  because 
of  the  completion  process.  However,  we  do  know  this  inequality  for  all  A  G  A  where  A 
consists  of  all  A  that  are  in  C(T)  but  are  not  in  Ta  for  any  A  G  A.  Indeed,  this  is  the 
same  as  in  the  proof  of  Theorem  5.2.  Now,  for  any  A  G  A,  we  let  Ty  be  the  tree  with 
single  root  A  which  consists  of  all  cells  in  T  contained  in  A.  Then,  Ty  is  admissible  and 
the  same  derivation  as  (7.12)  gives 

Y1  e(A')  2  C0e(A)p(A).  (7.14) 

A'e£(TA) 

Since  every  A'  G  £(T0)  appears  either  in  A  or  in  one  of  the  T(Ty),  A  G  A,  we  obtain 
using  (5.6) 

Y1  e(A')  <  Co  Y,  e(A)jp(A)  +  £  e(A)p(A)  <  C0t(N„  +  1 V(f )).  (7.15) 

A'eU(To)  AeA  AeA 

This  gives  (a)  because  of  (7.11),  (7.13)  and  the  fact  that  N(T)  <  N(T).  □ 

Proof  of  (b): 

We  shall  follow  the  same  line  of  proof  as  in  Theorem  5.2  until  we  run  into  a  difficulty 
and  then  we  shall  show  how  to  overcome  that  difficulty.  The  proof  uses  T  rather  than  T 
in  the  constructions.  We  let  C*  again  denote  the  set  of  leaves  of  jC(T*)  which  are  interior 
nodes  of  T.  For  each  A*  G  A*  we  define  Ta*  as  the  set  of  all  A  G  T  \  C(T )  which  are 
contained  in  A*.  In  Theorem  5.2  we  showed  that 

tM<C0  e(A*)  A  Co Em  (7.16) 

A  *e£* 

where 

M:=  N(Ta*)-  (7-17) 

A  *e£* 

We  shall  show  this  again  with  a  new  constant  on  the  right  side  of  (7.16)  but  we  need  a 
different  proof  because  we  do  not  have  the  subadditivity  used  in  (5.15). 
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Lemma  7.3  For  each  A*  G  jC*  ,  we  have 


tN(TA .)  <  C5e(A*) 


(7.18) 


with  C5  an  absolute  constant. 

Proof:  In  the  case  that  {A*}  =  £(Ta*),  we  have  N(TA*)  =  0  and  (7.18)  holds  trivially. 
So  in  going  further  we  assume  N(TA> )  >  1  which  means  that  the  32  children  of  A*  from 
the  NEW  SUBDIVISION  RULE  are  in  the  interior  of  f .  Let  A  G  £(TA.)  and  let  A 
be  its  parent  relative  to  the  NEW  SUBDIVISION  RULE  used  to  create  T.  Recall 
that  e  is  constant  on  each  of  the  32  children  of  A.  From  Property  I,  we  can  choose 
one  of  these  children  A'  which  is  strictly  contained  in  the  interior  of  A  and  for  which 
-R-(A')  C  A.  Let  us  denote  by  A'  the  set  of  all  such  A'  that  we  have  created.  We  note 
that  for  each  pair  Ai,  A2  in  A'  the  sets  R-(Ai)  fl  i?_(A2)  has  measure  0.  Indeed  the  two 
parents  Ai,  A2  of  these  cells  are  disjoint  (for  this  property  one  is  using  the  fact  that  when 
Ax  is  subdivided  so  are  all  of  its  siblings).  Thus,  from  the  set  subadditivity  of  ||  •  ||2  we 
obtain 

t#(A')<  e(A')<e(A*)  (7.19) 

A'eA' 

where  we  have  used  the  fact  that  e(A')  >  t,  A'  G  A',  because  these  nodes  are  interior 
to  T.  Now  since  #(A')  =  #(£(Ta*)/32  and  N(TA*)  <  2#(C(TA*))  (see  (3.1)  and  (3.2)), 
(7.18)  follows  from  (7.19)  with  C5  =  64.  □ 

Having  established  (7.18),  we  can  sum  over  all  A*  in  C*  and  derive  (7.16)  with  a  new 
constant  in  place  of  Co- 

We  can  conclude  the  proof  of  Proof  of  (b)  as  in  Theorem  5.2.  Namely,  let  T'  be 
the  tree  obtained  from  T  by  deleting  all  of  the  leaves  of  T.  Then,  T'  contains  TA*  for  all 
A*  G  C* .  As  in  (5.20),  we  obtain  (here  we  have  K  =  32) 

N(T)  <  (I<  +  l)V(T')  and  N0  +  N(T)  <  ( K  +  2 )N(T').  (7.20) 

Finally,  we  claim  that 

N(T')  <  CM  (7.21) 

with  C  an  absolute  constant,  which  will  complete  the  proof.  To  see  (7.21),  we  note  that 
the  only  interior  nodes  of  T'  that  are  not  interior  nodes  in  one  of  the  TA*  are  those  which 
are  interior  nodes  of  T*.  This  means  that  there  are  at  most  N(T*)  <  rn  of  them.  Hence, 

N(T')  <M  +  m<M  +  N(T)/(2K  +  2)  <  M  +  N(T') /2  (7.22) 

where  we  have  used  (7.20)  and  the  inequality 

N(T)  >  C^NiT)  =  C^n  >  2 (K  +  1  )m  (7.23) 

which  holds  provided  C3  is  chosen  smaller  than  C^1  /(2AT  +  2).  This  completes  the  Proof 
of  (b)  and  completes  the  proof  of  Theorem  7.2.  □ 

Finally,  we  remark  that  the  properties  of  the  MODIFIED  SECOND  ALGO¬ 
RITHM  can  also  be  described  in  the  form  of  Corollary  5.4. 
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Corollary  7.4  There  are  absolute  constants  Cl,c1>  0  such  that  for  any  error  tolerance 
H  >  0,  the  tree  produced  by  the  MODIFIED  SECOND  ALGORITHM  has  the 
properties: 

(i)  If  T  C  T*  is  any  tree  satisfying  E(T )  <  cip,  then 

<  C^NiT).  (7.24) 


(ii)  The  number  of  evaluations  of  e  and  the  number  of  arithmetic  operations  in  producing 
T u  is  <  Ci(N0  +  N(T^)). 

Proof:  The  proof  of  this  corollary  is  the  same  as  that  of  Corollary  5.4.  □ 
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